For the Lexicon that Has Everything

نویسندگان

  • Martha W. Evens
  • Joanne Dardaine
  • Yu-Feng Huang
  • Sun M. Li
  • Judith A. Markowitz
  • Frank J. Rinaldo
  • Margi Rinaldo
  • Robert Strutz
چکیده

This paper argues that it is impossible to separate lexical and encyclopedic knowledge and describes an attempt to build a large lexical database that contains the range of information needed to make a parser or a text generation system interpret and use words and phrases correctly. 1 I n t r o d u c t i o n Any natural language processing system needs both knowledge about words and knowledge about the world. Many natural language systems divide these two kinds of knowledge into two knowledge bases, which we call the lexicon and the encyclopedia for the purposes of this discussion. We argue that the distinction between the lexicon and the encyclopedia is difficult to maintain both in theory and in practice. We describe the design and development of a large lexical database intended to support parsing, generation, and information retrieval applications. We claim that these applications require information of many different kinds, some of which is traditionally stored in a dictionary, some in a thesaurus, and some in an encyclopedia. We need to support not just alphabetic access to this information but access through semantic links. Bierman [1964] was one of the first to describe lexical-semantic links between words. They define the basic organization of semantic information, he claims. He paints an image of a very large single-page dictionary with language-specific nodes connected by semantic relations. Can we distinguish between the lexicon and the encyclopedia in this context? Bierwisch and Kiefer [1970] assume that both kinds of information are contained in the same lexical entry. The distinction between linguistic or lexical and encyclopedic knowledge, they say, corresponds to the difference between the core and the periphery of a lexical entry, where: The core of a lexical reading comprises all and only those semantic specifications that determine, roughly speaking, its place within the system of dictionary entries, i.e., delimit it from other (non-synonymous) entries. The periphery consists of those semantic specifications which could be removed from its reading without changing its relation to other lexical readings within the same grammar. [Bierwisch and Kiefer 1970, 69-70] The major difficulty with this criterion is its instability. As new entries are added to the system, information sufficient to distinguish one entry from another may have to be

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Demonstration of Multi Statutory of the Adjective “Just” in Modern Adjectival English Lexicon

This article concerns the general functional features of an adjective in modern English, and in particular multistate lexical item “just”, which carries different meanings in different variants of combinatorics. The authors analyze the combinations used with the adjectival lexeme item “just” and reveal the categories that determine the degree of semantic content of each given statement. The nee...

متن کامل

The Effect of Lexicon-based Debates on the Felicity of Lexical Equivalents in Translating Literary Texts by Iranian EFL Learners

This study was an attempt to investigate the effect of lexicon-based debates on the felicity of lexical equivalents in translating literary texts by Iranian EFL learners.  To fulfill the purpose of this study, 59 university students, majoring in English Translation, were randomly assigned to the experimental and control groups from a total of 73 students based on their performance on a mock TOE...

متن کامل

Imam Sadegh’s (AS) Hadiths in Sunni’s lexicon

The Quran and Hadiths including Infallibles (AS) Hadiths such as Imam Sadegh (AS) were one of compilation references, and also, one of the fields of research for Arabs morphologists from long time ago. Imam Sadegh’s (AS) Hadiths based on Sunni’s lexicon, and then, based on another Islamic science books will be illustrated in this research in order to identify where these Hadiths hav...

متن کامل

Is It Time to Say Farewell to the ISDS System?; Comment on “The Trans-Pacific Partnership: Is It Everything We Feared for Health?”

Investor-state dispute settlement (ISDS) continues to plague health-oriented government regulation. This is particularly reflected in recent challenges to tobacco control measures through bilateral investment agreements. There are numerous reform proposals circulating within the public health community. However, I suggest that perhaps it is time for the community to explore a stronger position ...

متن کامل

A Supervised Method for Constructing Sentiment Lexicon in Persian Language

Due to the increasing growth of digital content on the internet and social media, sentiment analysis problem is one of the emerging fields. This problem deals with information extraction and knowledge discovery from textual data using natural language processing has attracted the attention of many researchers. Construction of sentiment lexicon as a valuable language resource is a one of the imp...

متن کامل

یک روش دو مرحلهای برای بازشناسی کلمات دستنوشته فارسی به کمک بلوکبندی تطبیقی گرادیان تصویر

This paper presented a two step method for offline handwritten Farsi word recognition. In first step, in order to improve the recognition accuracy and speed, an algorithm proposed for initial eliminating lexicon entries unlikely to match the input image. For lexicon reduction, the words of lexicon are clustered using ISOCLUS and Hierarchal clustering algorithm. Clustering is based on the featur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991